pdf-icon

AtomS3R-M12 Volcengine Kit

SKU:D062-M12

Description

AtomS3R‑M12 Volcengine Kit is an IoT vision+voice development kit that deeply integrates M5Stack hardware with Volcengine’s cloud AIGC one-stop solution. It consists of two core parts: the high-performance image capture unit AtomS3R‑M12 and the AI voice processing base Atomic Echo Base. AtomS3R‑M12 provides 3 MP wide-angle video capture and edge computing capabilities, with expansion interfaces for various sensors. Atomic Echo Base integrates high-fidelity audio decoding, microphone, and speaker drivers, supporting full-duplex voice wake-up, recognition, and interaction. Volcengine RTC, in collaboration with M5Stack, offers a built-in one-stop solution that integrates advanced audio processing (including wake‑up and audio 3A) on the chip side, and deeply incorporates large models, speech recognition, speech synthesis, function calling, and knowledge-base technologies on the cloud side, quickly achieving smooth, natural, human-like real-time communication between users and hardware. It is widely applied in smart security, remote education, smart home, industrial monitoring, AI robotics, and other fields.

Product Features

  • Volcengine RTC real-time communication
  • AI visual recognition
  • AI voice recognition
  • Edge-to-cloud collaboration & model management
  • Integrated ESP32‑S3‑PICO‑1‑N8R8 SoC
  • 3 MP OV3660 camera (120° FOV)
  • Nine‑axis sensor system
  • Edge AI inference
  • 8 MB Flash & 8 MB PSRAM
  • Infrared emission control support
  • Expandable pins & interfaces
  • Full‑duplex I2S audio
  • 24‑bit audio codec
  • MEMS digital microphone
  • Class D amplifier (8 Ω @ 1 W speaker)
  • Development platforms
    • Arduino IDE
    • ESP‑IDF
    • PlatformIO

Includes

  • 1 x AtomS3R‑M12
  • 1 x Atomic Echo Base

Applications

  • Smart security
  • Remote education
  • Smart home
  • Industrial monitoring
  • AI tutoring
  • STEAM education

Specifications

Specification Parameter
SoC ESP32‑S3‑PICO‑1‑N8R8, dual‑core Xtensa LX7 @240 MHz, USB‑OTG
Storage 8 MB Flash + 8 MB PSRAM
Wireless Wi‑Fi 2.4 GHz
Cloud Stream Processing Volcengine Stream real‑time stream access
Cloud Recognition Face detection, target tracking, OCR text recognition, ASR speech‑to‑text
Camera OV3660, 3 MP, F2.4 aperture, 120° FOV, 30 FPS
Infrared IR 180° emission angle, up to 12.46 m without obstruction
Sensor System Nine‑axis (BMI270 + BMM150)
Interfaces USB‑C (power/UVC plug‑and‑play), HY2.0‑4P expansion
UVC USB Video Class plug‑and‑play
Edge AI ESP32‑S3 + TinyML: on‑device image detection, keyword wake‑up
Audio Codec ES8311, 24‑bit I2S, 16 kHz–64 kHz
Microphone MEMS digital microphone, SNR ≥ 65 dB
Amplifier NS4150B Class D
Speaker 1 W @ 8 Ω
Communication Mode I2S full‑duplex
Operating Temperature 0 ~ 40 °C
Product Dimensions AtomS3R‑M12: 26.4 × 24.0 × 22.5 mm
Atomic Echo Base: 26.4 × 24.0 × 22.5 mm
Product Weight AtomS3R‑M12: 10.8 g
Atomic Echo Base: 10.8 g

Learn

Download Mode
To flash firmware, press and hold the reset button (for about 2 seconds) until the internal green LED lights up, then release; the device will enter download mode and wait for flashing.
schematics

Schematics

PinMap

BMI270 & IR & RGB

ESP32-S3-PICO-1-N8R8 G0 G45 G47
LP5562 (RGB control chip) SYS_SCL SYS_SDA
BMI270 SYS_SCL SYS_SDA
IR IR_LED_DRV

BMM150

BMI270 BMI270_ASDx BMI270_ASCx
BMM150 A_SDA A_SCL
BMM150 mounted on BMI270
Access BMM150 via BMI270’s Sensor Hub auxiliary I2C interface for unified 9‑axis sensor data collection

OV3360 (M12)

OV3360 (M12) ESP32-S3-PICO-1-N8R8
CAM_SDA G12
CAM_SCL G9
VSYNC G10
HREF G14
Y9 G13
XCLK G21
Y8 G11
Y7 G17
PCLK G40
Y6 G4
Y2 G3
Y5 G48
Y3 G42
Y4 G46
POWER_N G18

Atomic Echo Base

Atomic Echo Base SCL SDA SD/DSDIN WS/LRCK ASDOUT SCK/SCLK
AtomS3R M12 G39 G38 G5 G6 G7 G8

HY2.0-4P

HY2.0-4P Black Red Yellow White
PORT.CUSTOM GND 5V G2 G1

Model Size

Datasheets